Support the merge of decoder without/with past for encoder-decoder models in the ONNX export #926

fxmarty · 2023-03-27T11:15:15Z

As per title, will need #924 to be merged first.

Next PR: support this in ORTModel as well.

HuggingFaceDocBuilderDev · 2023-03-27T11:38:53Z

The documentation is not available anymore as the PR was closed or merged.

michaelbenayoun

LGTM

michaelbenayoun · 2023-03-27T13:40:04Z

optimum/exporters/onnx/base.py

+        if self.is_merged is True and self.use_cache_branch is True:
+            reference_model_inputs["use_cache_branch"] = DummyInputGenerator.constant_tensor(shape=[1], value=True)
+        elif self.is_merged is True and self.use_cache_branch is False:
+            reference_model_inputs["use_cache_branch"] = DummyInputGenerator.constant_tensor(shape=[1], value=False)


Suggested change

if self.is_merged is True and self.use_cache_branch is True:

reference_model_inputs["use_cache_branch"] = DummyInputGenerator.constant_tensor(shape=[1], value=True)

elif self.is_merged is True and self.use_cache_branch is False:

reference_model_inputs["use_cache_branch"] = DummyInputGenerator.constant_tensor(shape=[1], value=False)

if self.is_merged:

reference_model_inputs["use_cache_branch"] = DummyInputGenerator.constant_tensor(shape=[1], value=self.use_cache_branch)

edit: actually this is less explicit

optimum/exporters/onnx/base.py

fxmarty added 3 commits March 27, 2023 13:14

add support in the export

7977917

add tests

6d6d27d

Merge branch 'master' into support-encoder-decoder-merge-in-onnx-export

8a76bef

fix

3bcc4cd

fxmarty requested review from michaelbenayoun, JingyaHuang, regisss and mht-sharma and removed request for michaelbenayoun and JingyaHuang March 27, 2023 12:14

fxmarty mentioned this pull request Mar 27, 2023

Regression: merge_decoders fails in 1.7.3 #921

Closed

4 tasks

michaelbenayoun approved these changes Mar 27, 2023

View reviewed changes

fxmarty force-pushed the support-encoder-decoder-merge-in-onnx-export branch from 9005998 to 8cede08 Compare March 28, 2023 08:20

fix tests

afb4220

fxmarty force-pushed the support-encoder-decoder-merge-in-onnx-export branch from 8cede08 to afb4220 Compare March 28, 2023 08:21

remove print

df7cf2c

fxmarty merged commit 0c3713d into huggingface:main Mar 28, 2023

This was referenced Mar 30, 2023

Uncaught (in promise) Error: failed to call OrtRun(). error code = 6 huggingface/transformers.js#54

Closed

[Model request] Helsinki-NLP/opus-mt-ru-en (marian) huggingface/transformers.js#63

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support the merge of decoder without/with past for encoder-decoder models in the ONNX export #926

Support the merge of decoder without/with past for encoder-decoder models in the ONNX export #926

fxmarty commented Mar 27, 2023

HuggingFaceDocBuilderDev commented Mar 27, 2023 •

edited

Loading

michaelbenayoun left a comment

michaelbenayoun Mar 27, 2023

fxmarty Mar 27, 2023 •

edited

Loading

Support the merge of decoder without/with past for encoder-decoder models in the ONNX export #926

Support the merge of decoder without/with past for encoder-decoder models in the ONNX export #926

Conversation

fxmarty commented Mar 27, 2023

HuggingFaceDocBuilderDev commented Mar 27, 2023 • edited Loading

michaelbenayoun left a comment

Choose a reason for hiding this comment

michaelbenayoun Mar 27, 2023

Choose a reason for hiding this comment

fxmarty Mar 27, 2023 • edited Loading

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 27, 2023 •

edited

Loading

fxmarty Mar 27, 2023 •

edited

Loading